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1 Introduction 

Blind identification of an unknown code from the observation of noisy code¬ 
words. We address here a problem related to cryptanalysis and data security 
where an observer wants to extract information from a noisy data stream 
where the error correcting code which is used is unknown. Basically an ob¬ 
server has here several noisy codewords originating from an unknown code 
and wants to recover the unknown code and decode it in order to recover 
the whole information contained in these codewords. Generally this prob¬ 
lem is solved by making assumptions on the code which is used in this sce¬ 
nario (convolutional code, LDPC code, turbo-code, concatenated code, etc.). 
This is called the code reconstruction problem or blind identification of a 
code problem in the literature. This problem arises for instance in a non- 
cooperative context where observing a binary sequence originating from an 
unknown communication system naturally leads to such a problem, for more 
details see the introduction of [I]. It also arises in the design of cognitive re¬ 
ceivers which are able to cope with a great variety of error correcting codes p] 
or in the study of DNA sequences when looking for possible error correct¬ 
ing codes in the genetic code [2j. This problem has a long history: it has 
been addressed for a variety of codes, linear codes !M, cyclic codes MD], 
LDPC codes [5JE], convolutional codes [T UTT1 12E] . turbo-codes [27H37], BCH 
codes [551 - BT] . Reed-Solomon codes jTTI ITT] and Reed-Muller codes |T51J . 

We focus here on the problem of reconstructing an unknown code when 
an interleaver is applied after encoding. Recall that this is a commonly used 
technique to correct burst errors since it provides some sort of time diversity 
in the coded sequence. For instance when the code is a convolutional code, 
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this allows to spread the burst errors in remote locations and convolutional 
decoding performs much better. This problem has been already addressed 
in a series of papers [^6Tt5Q] . All these papers assume that the interleaver is 
structured (a convolutional interleaver [46l 149] or an helical scan interleaver 
[50]). It should be said here that the methods used in these papers make 
heavily use of the particular structure of the interleaved that is considered 
and some of these methods are highly sensitive to noise nam! or do not 
apply when there is noise [ 161H8] . 

In this paper, we will focus on the case where the code is a convolutional 
code and where the interleaver is a block interleaver. This pair is used in 
several standards, for example, in 802.lln, 802.16e, 802.22, GMR-1 and Wi- 
media. In practice the block interleaver is structured but this structure is not 
always the same, so to reconstruct the interleaver in all cases we will assume 
that the interleaver has no particular structure, it is chosen randomly among 
all possible permutations. Our problem can be formalized as follows. 

Blind identification of an unknown interleaved convolutional code: state¬ 
ment of the problem, hypotheses and notations. The encoding process which 
is studied in this paper is described in Figure |T| and consists in taking an 
information word of length mk and feeding it into an (n, k ) convolutional 
encoder to produce a codeword x of length N = mn. We denote by C the set 
of codewords obtained by this convolutional code (in other words this is a 
convolutional code truncated in its first N entries). The codeword x is then 
permuted by a fixed block interleaver n of length N, we denote by y the 
interleaved codeword, y is then sent through a binary symmetric channel of 
crossover probability p. At the output of the channel we observe the noisy 
interleaved codeword 2 . 

The blind identification process consists in observing M noisy interleaved 
codewords z 1 ,, z M to recover the convolutional code C and the block in¬ 
terleaver 7T. The codeword and interleaved codeword associated to z l are 
respectively denoted by x l and y\ We also denote by the code C inter¬ 
leaved by 7r (y 1 ,..., y M belong to it). We will assume that the length N 
of the interleaver is known. It can obtained through the techniques given 
in pITl loT| and recovering this length can now be considered to be a solved 
problem. However, contrarily to [46l foO] we will make no assumption on the 
interleaver: it is chosen randomly among all permutations of size N. The 
convolutional code C is assumed to be unknown, its parameters n and k are 
also unknown. 

Our contribution. In this paper, we reconstruct the block interleaver ir 
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Figure 1: The communication scheme considered here. 


and we recover the convolutional code C. For this, we search for the dual code 
of C n . This code is recovered by known techniques [6] which are adapted to 
recover parity-check equations of low weight given noisy codewords. Once C n 
is recovered, we classify the parity-check equations that have been obtained 
for C n into groups. Each group contains parity-check equations that corre¬ 
spond to parity-check equations of C whose positions differ by a multiple of 
n. We will then focus on a specific group of equations and will be able to 
reconstruct the convolutional code and the unknown interleaver at the same 
time by introducing novel graph techniques in this setting. 

By running some experimental tests we have been able to demonstrate the 
efficiency of this method. For example, for a convolutional code with parity- 
check equations of weight 6, after finding a set of the parity-check equations, 
we reconstruct an interleaver of length 8000 in less than ten seconds. The 
time for finding the interleaver once C„ has been recovered does not depend 
on the noise level, the noise only impacts searching the parity-check equations 
of C n . ft should be added that the method used to recover these parity-check 
equations is more efficient than the methods calculating the rank of matrices, 
in particular when the data are noisy. This allows us to reconstruct efficiently 
the interleaver even for moderate noise levels. 


2 Overview of the algorithm 

Our reconstruction algorithm makes heavily use of two properties of the 
parity-check equations of an (n, k) convolutional code 

(i) for small up to moderate constraint length (which is the case of all 
convolutional codes used in practice) the parity-check equations are of 
low weight; 
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(ii) with the exception of a few parity-check equations involving the first 
bits, shifts of a parity-check equations by a multiple of n are also parity- 
check equations of the convolutional code. 

Interleaving does not destroy the first property but the second property is 
lost for C n . From now on we denote by t the “essential” minimum weight of 
parity-check equations of C (or C n ). By essential minimum weight we mean 
here that we take the minimum of the weights that appear at least a linear (in 
N ) number of times. We use this definition to discard parity-check equations 
that could involve the first bits of C and that could be of lower weight due to 
the zero initialization of the convolutional code. For the reconstruction we 
need a list C of these parity-check equations of weight t. 

Our algorithm basically works as follows 

1. We use the algorithm of [6] to find a list C of parity-check equations of 
weight t. 

2. We classify the parity-check equations of C into disjoint groups C i,..., C r 
such that two parity-check equations fall into the same group if, and 
only if, they correspond to parity-check equations of C which are shifts 
of each other by a multiple of n. 

3. Denote by C\ the group of parity-check equations of C which have 
the smallest intersection number. The intersection number of a parity- 
check equation £ is the number of parity-check equations in C which 
have at least one position in common with £. We use this group to re¬ 
cover one parity-check equation of C by graph theoretic considerations. 

4. We use this parity-check equation of C to reorder the parity-check equa¬ 
tions in C\. Note that the structure of the group C\ is such that these 
£ parity-check equations £\,... ,£g correspond to l parity-check equa¬ 
tions £{,... ,£' e of C which are shifts of a multiple of n of each other. 
This reordering is done in such a way that £[ is the shift by n{i — 1) of 
%■ 

5. This reordering of is then used in the last step to recover 7r. 
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3 Notation 


A parity check equation £ will be denoted by the set of positions it involves, 
when we write £ = {ei,..., e*} we mean here that this parity-check equation 
involves the positions {ex,. .., e t } of the code that is considered (which is 
generally clear from the context). 

With set notation applying an interleaver n to code positions really 
amounts to transform a parity-check equation £ = {ex,... ,e t } into a parity- 
check equation n(£) = (7r(ei),..., 7r(e t )}. 

4 The reconstruction algorithm in detail 

4.1 Recovering parity-check equations of weight t 

The first step consists in searching for a list C of parity-check equations of C n 
of weight t. To obtain this list we apply the algorithm of [6j. This method 
allows us to find the parity-check equations even if the observed codewords 
are noisy. 

4.2 Classifying parity-check equations into groups 

We want to classify parity-check equations of C into disjoint groups Cx, ■ ■ ■, C r 
such that the parity-check equations in a group correspond to parity-check 
equations of C that are shifts of each other by a multiple of n. We say that 
these parity-check equations are of the same type. 

Definition 1 (Type of a parity-check equation of C). £ = {ex,... ,e t } and 
£' = {e{,... ,e' t } two parity-check equations of C are of the same type if £' 
is a shift by a multiple of n of £. This means that there exists i such that 
{ex, ■ ■ ■, e t } — {ej + in ,..., e[ + in}. In such a case we write £ ~ £'. 

All parity-check equations of the same type define an equivalence class. 

Why classify 9 We need to classify parity-check equations of C because 
our method uses the regularity of parity-check equations of the convolutional 
code: shifts of parity-check equations by a multiple of n are also parity-check 
equations of the convolutional code. A convolutional code can satisfy several 
types of parity-check equations of the same weight t. 
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Example 1. The (2,1) convolutional code which satisfies £1 + £2 + £3 + £5 + 
£ 6 + £ 8 = 0 also satisfies its shifts: Vi, Xi +2i + x 2+2i + x 3+2i + x 5+2i + x e+2i = 0. 
If we add two consecutive parity-check equations, we obtain another parity- 
check equation: £i+ 2 i+£ 2 + 27 +£ 4 + 2 *+£ 6 + 2 i +£ 7 + 27 +£ 10+21 = 0. This equation 
is verified for all integers i. So this code has at least two equivalence classes 
of parity-check equations: the first is represented by £ = {1, 2,3, 5, 6, 8} and 
the second by S' = {1,2,4,6,7,10}. The weight of all these parity-check 
equations is 6. In this case, when we search for parity-check equations of 
weight 6 of C n we find equations corresponding to a mixture of these two 
types. 

If we had directly parity-check equations of C instead of parity-check 
equations of C n , then these different types of parity-check equation might get 
differentiated by their span: 

Definition 2 (Span of a parity-chek equation). Let £ = {e\,... ,e t } be a 
parity-check equation, its span s £ is defined by s £ = maxj(ej) — mirq(e 8 ) + 1. 
In an equivalence class, all parity-check equations have the same span and 
we call this quantity the span of the equivalence class. 

Once interleaving this property is lost but the equivalence classes are 
always present: 

Definition 3 (Type of a parity-check equation of C n ). Two parity-check 
equations £ and £' of C n are of the same type if 7r _1 (£) ~ 7 r -1 (£'). 

How to classify? Even if we can not use the notion of the span of parity- 
check equations to classify the equations of C n , we will use the related notion 
of neighbourhood profile: 

Definition 4 (Neighbourhood profile). Let £ G C, its neighbourhood pro¬ 
file V s is a vector of length t: V £ = (Vf ,..., V £ ) where V £ = #{£' G 
C such as \£ fl £'\ = i}. 

In other words, for a parity-check equation £, Vf is equal to the number 
of parity-check equations which have exactly i positions in common with £. 
The number of parity-check equations with at least one position in common 
with £ defines its intersection number: 

Definition 5 (Intersection number). The intersection number T £ of a parity- 
check equation £ G C is equal to Z £ = J2i<t • 
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Use profiles to determine the type of parity-check equations. The point of 
Definition [I] is that all parity-check equations of C of the same type have the 
same neighbourhood profile, whereas two equations of two different types 
have (in general) two different neighbourhood profiles. It is also the case 
after interleaving the parity-check equations of C n . 

Therefore we can classify parity-check equations into groups using their 
neighbourhood profiles. Of course, parity-check equations involving extreme 
positions of x (the first or last) do not have exactly the same neighbour¬ 
hood profile as the other parity-check equations of the same type. These 
parity-check equations have lost parity-check equations in their neighbour¬ 
hood. This motivates to bring in the following partial order on the profile of 
parity-check equations 

Definition 6 (Partial order on the profiles of parity-check equations). We 
define a partial order: V <V' if Mi < t, Vi < V[ . 

Classifying a given parity-check equation. The algorithm for classifying 
parity-check equations into groups is given by Algorithm [T] With this algo¬ 
rithm we also deduce the length n of the convolutional code C. 

Remark 1 (Choose a group). We form r groups, but we just need one of 
them. We choose a group that minimizes the intersection number of its 
parity-check equations. This is a heuristic whose rationale is that the group 
with the smallest intersection number corresponds probably to the equivalence 
class with the smallest span. Indeed, in C, the larger the span of a parity- 
check equation £ is, the more chances we have that there are parity-check 
equations with at least one position in common with £. 


Remark 2 (Deducing the length n of C). The number nb eq of parity-check 
equations in the group that we keep allows us to deduce the size n of the 
convolutional code C, n = L^y-J • This equality is due to the fact that almost 
all parity-check equations in this group correspond (after deinterleaving) to 
shifts by a multiple of n of a single parity-check equation. 

4.3 Recovering a parity-check equation of the convo¬ 
lutional code 

From now on, we assume that we have a set C\ of parity-check equations of 
C n . These parity-check equations are of weight t and in the same equivalence 
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Algorithm 1: Classifying parity-check equations and deducing n 
input: C a set of parity-check equations of C n 

output: 

• a set of parity-check equations of the same type 

• the length n of the convolutional code, 
for all £ G C do 

j V s t— the neigbourhood profiles of £ 

V El ,... ,V Er t— most frequent profiles in {V £ ,£ G C] 

C\, , C r i — 0 

for all £ G C do 

if V Ei is the unique profile such that V s < V Ei then 

| Ci <— Ci U {£} 

for all i G {1,..., r} do 

I 

Ci t— Ci with i such that I Ei = min,, X Ej 
return C\ and n 





class: they correspond to parity-checks of C which are shifts of each other by 
a multiple of n. 

We denote by £c a parity-check equation of C such that each parity-check 
equation £ of C\ satisfies 7r _1 (£) ~ £c (that is each parity-check equation £ 
of C\ is such that 7r _1 (£) is a shift of £ c ). 

The purpose of this subsection is to show how £q can be recovered from 
the knowledge of C\. £c is the parity-check equation of a sub-code of C, 
this sub-code is an (n, n — 1) convolutional code. To recover this (n, n — 1) 
convolutional code we test each (n, n — 1) convolutional code that admits 
a parity-check equation of weight t and with a span less than s max where 
s m ax is some chosen constant. Our strategy is to attach a graph to a set of 
parity-check equations such that 

(i) the equivalence class of a parity-check equation £ of an (n, n — 1) convo¬ 
lutional code discriminates the convolutional code 

(ii) if two sets of parity-check equations differ from each other by a permu¬ 
tation then their associated graphs are isomorphic. 

By checking if there is an isomorphism between the graph associated to C\ 
and the graph associated to shifts of a parity-check equation of an (n, n — 1) 
convolutional code we will recover the right convolutional code and adding 
labels to the graph will allow us to identify the permutation between the two 
sets of parity-check equations. 

Graphs associated to C\ and £ 

From now on we will use the following notation 

Notation We denote by £ the number of parity-check equations in C\. 

To the set of parity-check equations C\ we associate a labeled graph 
which is defined as follows 

Definition 7 (Graph associated to a set of parity-check equations). The 
labeled graph G(C) associated to a set C, of parity-check equations is such 
that 

• Each parity-check equation of C is represented in G{C) by a vertex. 

• If £ and £', two parity-check equations of C, have k positions in com¬ 
mon (that is \£ n£'\ = k) then in G(£) the two corresponding vertices 
are connected by k edges. 
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Figure 2: Graph Q associated to C = {£1 = {1,4, 6}, £ 2 = {2,4,5},£ 3 = 
{4, 6,7}, £ 4 = {2, 5,7}} 

• Each edge of Q(C) is labeled with the number of the position that it 
represents. 

When C is clear from the context we will just denote this graph by Q. 
Notation We denote by Q the graph Q without label on edges. 

Example 2. Let C = {£ 1 , £ 2 , £ 3 , £ 4 } with E\ = {1,4,6}, £ 2 = {2,4,5}, 
£3 = {4, 6, 7} and £4 = {2, 5, 7}. The graph Q associated to C is represented 
on Figured 

The graph G(C\) associated to C\ represents the interleaved sub-code 
of C. To recover this sub-code (not interleaved) we test each (n,n — 1) 
convolutional code. This is achieved as follows. An (n, n — 1) convolutional 
code is defined by a parity-check equation £ = {ei,..., e t }. Using £ we 
construct a set Cs of parity-check equations of this (n, n — 1) convolutional 
code: C£ = {{ei + in, e 2 + in,..., e t + in}, — - < i < -}. C£ contains £ 
consecutive parity-check equations obtained by shifts of £ by a multiple of n. 
Using Definition 0 we associate the graph G{Cg) to this set C£. To simplify 
notation we denote this graph by G £ . 

We want to compare this graph Q £ to 1 ), it is for this reason that we take 
£ parity-check equations in Cg, so the two graphs have the same number of 
vertices, and we check if they are isomorphic. 

Definition 8 (Isomorphic graphs). Two graphs Q and Q' are isomorphic if, 
and only if, there exists a bijective mapping (j) between the vertices of Q and 
the vertices of Q' so that for any pair of vertices (x,y) of Q there is the same 
number of edges between x and y as there are edges between <f(x) and <p(y) 
in Q'. 
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We we also need a finer definition of isomorphism which is suitable for 
labeled graphs 

Definition 9 (Equivalent graphs). Two labeled graphs Q and Q' are equiv¬ 
alent if they are isomorphic (call the corresponding mapping <f>) and there 
exists a bijective mapping if of the labels from one graph to the other so that 
for any pair of vertices (x, y) of Q if we denote by {ai,..., a s } the (multi)set 
of labels of the edges between x and y, then the edges between (f{x) and <f(y) 
in Q' have labels {^(oi),..., if(a s )}. 


To recover the parity-check equation Sc of the sub-code of C and the 
interleaver n we use the following proposition 

Proposition 10. If £ = Sc then Q £ and G{£i) are equivalent. 

Proof. We assume that S = Sc- Let C£ = {So, ■ ■ ■, £e-i} be the set of 
parity-check equations associated to S, with Si being equal to £*_i shifted 
by n. S — Sc so C\ contains the same parity-check equations as Cs but 
interleaved by the interleaver n: C\ = {7r(£ 0 ), tt(£i), • • •, 7t(£^_i)}. Note 
that the interleaver changes the numbering of positions, not the number of 
positions in common between two parity-check equations. An isomorphism 
(f between vertices of this graph is given by (f : Q £ —> G(C i), Si (->• n(Si), 
\/i < £ — 1. This shows that that Q £ and G(Ci) are isomorphic. If we 
denote by mr the minimal value such that Q £ contains a vertex representing 
a parity-check equation involving the position rrr, the block interleaver n 
gives an isomorphism if on labels between Q £ and G n . if : Q £ —\ GlyCi), 
i (->• 7 r(z — m £ ). We obtain that Q £ and Q(C i) are equivalent. □ 

Remark 3. If we find a parity-check equation £ such that G(C\) and Q £ are 
equivalent, then an isomorphism if between labels of these graphs gives the 
block interleaver II such that 11(C) = C w . 

Sub-graphs associated to C\ and S 

To check the equivalence we will need auxiliary graphs which are much 
smaller and that will in general be sufficient for testing the equivalence be¬ 
tween graphs. More precisely, we use sub-graphs induced by Q £ and Q(C i). 
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Notation From now on to simplify notation we will denote the graph 
£(A) by g\ 

We will associate six sub-graphs, Gf, Gf , Gf , Gf , an d to and £ 
such that if G £ and G n are equivalent then: 

• G\ and G\ are isomorphic 

• G £ and Gf are isomorphic 

• G% and Gf are equivalent 

The first graphs G f and Gf are not labeled and represent the neighbour¬ 
hood of a parity-check equation. 

To obtain GT, we randomly choose a parity-check equation So in C\. Gf 
is the sub-graph of G 71 induced by the vertex representing So and all vertices 
having at least one edge in common with it. 

Gf is a sub-graph of G s induced by vertices representing S and all its shifts 
by a multiple of n such that they have at least one position in common with 
S. This graph contains only a small number of vertices as shown by 

Proposition 11. Let S be a parity-check equation of an {n,n — 1) convolu¬ 
tional code and s £ the span of S. The sub-graph G f associated to S contains 
£ 

at most 2 \—] — 1 vertices. (In other words, the parity-check equation S has 
£ 

at most 2[^-] — 1 parity-check equations in its neighbourhood.) 

Notation We denote by £b) the parity-check equation equals to S shifted 
by in. 

Proof. Assume that the parity-check equation S is given by S = {e\,... ,e t } 
with e\ < e 2 < ■ ■ ■ < e t . s £ is the span of S, so s £ = e t — e 1 + 1. £b) 
is represented in G f if and only if £b) and S have at least one position in 
common, that is {ei,..., e t } fl {ei + in, ..., e t + in} ^ 0. 

For i > 0 we have {ei,..., e t } fl {ei +in ,..., e t + in} ^ 0 if e t > e\ + in, that 
is 0 < * < —. So for i > 0, there are at most ("—] parity-check equations 
which can have positions in commons with S. 

If i < 0. {ei,..., e t } fl {ei + in,... ,e t + in} ^ 0 if e t + in > e\, that is 
— — < i < 0. For i < 0, there is at most [—] — 1 parity-check equations 
which can have positions in commons with S. 
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So, the maximal number of parity-check equations which can have a position 
in common with £ is equal to 2\— ] — 1. □ 

Proposition 12. If 7t _1 (£ 0 ) does not involve the first or last positions, and 
if £ = £c then Gf and Gf are isomorphic. 

Proof. £ = £c (that is 7r“ 1 (^ 0 ) ~ £), we denote by £' the parity-check 
equation £^ such that 7r -1 (£ 0 ) = £^ ■ If 7 t _ 1 (£ 0 ) does not involve the first 
or last positions and if we denote by I = {ii,...,ij} the set of integers 
such as Vf G I, £' and £ f l ^ have at least one position in common, then 
C\ contains i t (£'^),... , 7 r(£ ,( W). All these parity-check equations have at 
least one position in common with £ 0 , so they are represented in Gf and 
we have an isomorphism between G\ and Gf defined by : Gf —> Gf, 
£« ^ n(£^). □ 

So the hrst step to test a given (n,n — 1) convolutional code, consists 
in checking if Q f and Gf are isomorphic. If Q f and Gf are not isomorphic 
then £ £q. But these graphs are not enough discriminating, two (n, n — 1) 

convolutional codes, defined by £ and £’ can be associated to two isomorphic 
graphs Q f and Q f . 

Example 3. For n = 2, the graph Qf associated to the parity-check equation 
£ = {1,2, 4, 6, 7} is isomorphic to the graph Q f associated to the parity- 
check equation £’ = {1, 3,4, 6, 7}. These graphs are represented on Figured 
An isomorphism between these graphs is defined by : Gf — » Q f , £h) (_)• £ ,( F>, 

We associate to C\ and £ two other graphs Qf and ■ These graphs are 
not labeled and represent the neighbourhood at distance two of a parity-check 
equation. 

Qf is the sub-graph of G n induced by G\ and all vertices having at least 
one edge in common with a vertex of G\ ■ So Qf represents the neighbour¬ 
hood at distance 2 of £ 0 in C\. 

G% is the sub-graph of Q £ induced by G f and all vertices having at least one 
edge in common with a vertex of G\ ■ This graph is rather small too as shown 
by: 


Proposition 13. Let £ be a parity-check equation of an (n,n — 1) convo¬ 
lutional code and s £ be the span of £. The sub-graph G £ associated to £ 
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Figure 3: Graphs Qf and Qf with n = 2, £ = {1,2,4,6,7} and £' = 
{1,3,4, 6, 7}. 

g 

contains at most 4{—] — 3 vertices. (In other words, the parity-check equa- 
tion £ has at most 4|~^-] — 3 parity-check equations in its neighbourhood at 
distance 2.) 

Proposition 14. //7 t _1 (£o) does not involve the first or last positions, and 
if £ = £c then Qf, and Qf are isomorphic. 

Proof. £ = £c (that is 7r -1 (£ 0 ) ~ £), we denote by £' the parity-check equa¬ 
tion £^ such that 7r _1 (£ 0 ) = £^ k \ If 7r -1 (£ 0 ) does not involve the hrst or 
last positions and if we denote by / = the set of integers such 

that for all i in /, contains a vertex representing £^\ then C\ contains 
7r(£7^d),. . . , 7 1 {£'^ 1 ^). All these parity-check equations are in the neighbour¬ 
hood at distance 2 of £ 0 , so they are represented in Qf and we have an iso¬ 
morphism between and Qf defined by 4> : Qf, —> Qf, (->• 7t(£W). □ 

The second step to test a given (n, n — 1) convolutional code, consists in 
checking if C/f and Qf are isomorphic but these graphs are not sufficiently 
discriminating. Finally we use two small labeled graphs Qf, and Qf. 

To obtain and Qf we just add label on edges of Q f and Qf. 

Proposition 15. If 7t -1 (£ 0 ) does not involve the first or last positions, and 
if £ = £c then Q f and Qf are equivalent. 
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Proof. With Proposition [IH we deduce that Qf and Gf are equivalent. If we 
denote by m £ and m n the minimal values such that Q £ and Qf contain a 
vertex representing a parity-check equation involving respectively positions 
m £ and m n , then we have an isomorphism if between labels of Q £ and Qf : 
if : Q £ —>• Q21 i ^ — m£ + 7 r _1 (m 7r )). □ 

Finally the algorithm used for recovering the parity-check equation £q of 
the sub-code of C is Algorithm [2j 


Algorithm 2: Recovering the parity-check equation £c 
input: C\ a set of parity-check equations of weight t of the same type 
and n the length of C 

output: L a list of parity-check equations such that £c can be equal 
to each of them 

L <- 0 

£ 0 <— choose at random a parity-check equation of 

Gf,G2 an d G£ t— sub-graphs induced by £ 0 and its neighbourhood 

for all £ of weight t and with a span less than s max do 

Q f t— graph representing the neighbourhood of £ at distance 1 
if Q £ and Gf are isomorphic then 

Q £ graph representing the neighbourhood at distance 2 of £ 
if Q £ and Gf are isomorphic then 

G £ <r- labeled graph representing the neighbourhood at 
distance 2 of £ 

if G 2 an d Gf are equivalent then 
| L^LU{£} 

return L 


Reducing the number of tests 

In fact, these graphs have lots of symmetries, and we do not really need to 
test all parity-check equations of weight t and with a span less than s ma x- 
The following proposition allows us to reduce the number of (n,n — 1) con¬ 
volutional code that we have to test. 


Proposition 16. Let £ = {ei,...,et} be the parity-check equation of an 
(n, n — 1) convolutional code and Go be the labelled graph representing the 
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neighbourhood at distance two of £. 

£ can also be represented by a binary vector b\... b s where b t — 1 if i £ 

{d,.. .,e t }. 

• The graph Q £ ' associated to £' represented by the binary vector b s ... bi 
is equivalent to Q £ . 

• For all permutations p — (pi,... ,p n ] of length n, the graph Q £ ' associ¬ 
ated to the parity-check equation £' represented by the binary vector 
p(bi... b n )p(b n+ 1 ... b 2n ) • • .p{b s - n+ 1 ... b s ) is equivalent to . 

Proof. • For the first point, if Q £ contains £^ n \ ..., then Q £ ' contains 
£'( ll \ ..., £'^\ and between these two graphs we have the isomorphism 
0 defined by 0 : Q £ —> Q 2 , £^ for all i G {i\,...,ij}. 

The isomorphism ip between labels of these graphs can be defined by 
■0 : —5 > G 2 j k m £ — k where m £ is the maximal value of labels 

of Q £ . With these two isomorphisms we deduce that Q £ and Q £ are 
equivalent. 

• For the second point, if Q £ contains £^ ll \ ..., £^ then Q £ ' contains 

£'( ll \ ..., £d l if and between these two graphs we have the isomorphism 
0 defined by 0 : Q £ —> Q £ ', S® i-)- for all i e {ii,..., ij}. We define 
the permutation P by P(i) = p(i mod n) + . The isomorphism 0 

on labels defined by 0 : Q £ —t Q £ ', k i —y P(k) allows us to deduce that 
Q £ and Q £ are equivalent. 

□ 

Definition 17 (Equivalent parity-check equations). Let £ and £' be two 
parity-check equations, if using the Proposition \T5\ we can deduce that the 
two graphs Q £ and Q £ are equivalent we say that £ and £' are equivalent. 

Example 4. For n = 2, if s max = 20 and t = 10 we run only 15 328 tests 
instead of 184 756. If we suppose that s max = 30, 1238 380 tests are needed 
instead of 30 045 015. 

If the sought parity-check equation is £q = {1,2,3,5,6,7,8,12,13,14} (of 
weight 10), only 2 parity-check equations produce an isomorphic graph to Qf 
and among them one is equivalent to Qf for s max = 20 (testing the 15 328 
parity-check equations takes approximately 1 second). If we take s max = 30, 
4 graphs are isomorphic to Qf and 2 are equivalent to Qf, (one of them is 
eliminated later) these tests take less than 3 minutes. 
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If the parity-check equation Sc is not recovered 

If no parity-check equation has an equivalent labeled graph with Qf we may 
have chosen in C\ a parity-check equation So which has incomplete graphs 
(after deinterleaving this parity-check equation involves the first or last posi¬ 
tions of x or at least a parity-check equation in its neighbourhood at distance 
2 is missing in C\). 

In this case, we randomly choose another parity-check equation in £i, we 
compute the new graphs Qf, Qf and Qf representing its neighbourhood at 
distance one and two, and we test all convolutional codes. 


Remark 4. If Qf or Qf is incomplete, it is probably not symmetric so, no 
graph Q f or Qf can be isomorphic with it and the test of all ( n , n — 1) con¬ 
volutional codes is very quickly (we just compare the number of vertices, they 
have not the same number so they can’t be isomorphic). 


If Sq can be equal to several parity-check equations S we apply the end 
of the method for each of them. 

4.4 Ordering parity-check equations 

Using Sc the parity-check equation previously recovered, we want to order 
the parity-check equations of C\. That is, find an ordering A = S ai ,..., S a , 
of these parity-check equations such that 7r _1 (£ ai+1 ) is equal to the shift by n 
of 7T — 1 (£ ai ). All parity-check equations of C\ belong to A once and only once. 

To order these parity-check equations we extend the two graphs Qf and 
Qo c and we search for an isomorphism between the vertices of these two ex¬ 
tended graphs. This isomorphism give us the ordering A. 

When we recover the parity-check equation Sc of the sub-code of C we 
search for an isomorphism between Qf and Qjf. Once we know this isomor¬ 
phism, we also have the bijection f> between the vertices of these graphs. 
This bijection gives us a part of the ordering. Indeed, for all i, such that Qj c 
contains a vertex V, representing Sc shifted by in , we place the parity-check 
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equation represented by 0(Vj) at position i in A. 

To obtain the bijection using all parity-check equations of C\ and deduce 
the entire ordering A, we extend step by step Qf , G^f an d the bijection 0. 
We denote by Q^ c b the graph representing £ shifted by in for all integers 
i G \a. b], 0„ the isomorphism dehned for all integers between a and b, and 
Gi b the graph <S>{Q* h ). 

A step of the extension. Knowing Gf b and 0 a ..6, we search for Gf I c b + 1 > 
Ga..b +1 and 0a..6+i- 

• To obtain G^.b+i from Q £ ff b we just add a vertex representing £q shifted 
by (b + l)n and the corresponding edges. 

• We search in C\ for a parity-check equation £, which is not represented 
in Qf b and such that if we add a vertex representing this parity-check 
equation and the corresponding edges to Gf b > then <f a ..b +1 defined by 
0a.. 6+1 (.7) = f>a..b(j) for j e [a,..., b] and f a .. b+ i(b+l) = i is an isomor¬ 
phism between G^ c b+l and Gf, b extended with £ t . 

When we can not extend GZ.. b and 0a..6 such that £/f c b+1 and GZ..b+i 
are isomorphic, we extend these graphs and the isomorphism in the other 
direction. In other words, we search for G^-i b > GZ -1 6 and 0a-i..6 from ^f c fe , 

Gl. b and 0a..6* 

Remark 5 (Several parity-check equations). If at a given step, several parity- 
check equations £* of C\ can be chosen, then we extend the two graphs and 
the isomorphism with feedback and finally we choose the biggest isomorphism 
and corresponding graphs. 


Remark 6 (No parity-check equation). If no parity-check equation in 
satisfies all conditions, it might be that the sought parity-check equation is not 
in C\. This parity-check equation was not found using m or not classified in 
this group (it is an unclassified parity-check equation). In this case, we add 
a ”missing parity-check equation” to Qf b , that is we add a vertex and edges 
to respect the regularity of Qf b and we define f> a ..b+i(b + 1) = ” missing ” or 
0a—i..b(a — 1) = "missing". Then we continue the extension of graphs and 
0 . 
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Figure 5: Q^..b+i contains a missing parity-check equation 



Figure 6: Continue to extend the graph 


£ 

Remark 7. We do not add more than ("—] —1 consecutive ”missing” parity- 
check equations in A because in this case, the next parity-check equation has 
no position in common with previous parity-check equations. 

Example 5. We represent on Figures [^J 0 and\B the extension with a missing 
parity-check equation (in red). There is an edge connecting a vertex lying 
before the missing parity-check equation to a vertex lying after this missing 
parity-check equation. 

At the end we recover the isomorphism between Q £ and Q n . Indeed, at 
the end of the extension, Q £ b is equal to Q £ and Qf b to Q* probably with 
additional vertices representing missing parity-check equations. 

4.5 Reconstructing the interleaver 

Now we have the isomorphism between Q £c and () 7T , so to reconstruct the 
interleaver we need to recover the isomorphism between labels on edges of 
Q £c and Q*. 

We recover ifj step by step, at each step we search for a sub-graph of Q £c 
which has a label i appearing only once, or appearing a different number of 
times than the other labels. The label of the image by f> of this edge labeled i 
gives us f>{i). Then we remove all edges labeled by i in Q £c and by ip(i) in (j £n . 
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At the end of this extension, we extend 0 with positions which do not 
appear on graphs but are involved in parity-check equations represented by 
these graphs. 

0 defines the interleaver 7r, indeed 7r(z) = 0(i + mn £c ) where m £c is the 
minimal value such that Q £c contains a vertex representing a parity-check 
equation involving the position m £c . 


Remark 8 (Several isomorphisms). Depending on £q there might be several 
bijections between labels of the two graphs. In this case we have several inter¬ 
leavers. For these interleavers only the first and last positions are different. 
The number of interleavers just depends on £c ond not on the length of it. 


Example 6. The size of the interleaver is N = 26. The two graphs Q £c 
and Q 77 are represented on Figures \7\ and\8) 0 is defined by 0 : Q £c —* Q 77 , 


£q £ 5 , £( 1 H > £q, £q 1 t —y £%, .. ., £(, 1 1 —> £ 2 . 

If we take the sub-graph ofQ £c induced by £^ and £'f L> , then we deduce 

that -0(3) = 1 because the label 3 appears tree times and no other label appears 
tree times in this sub-graph. Then we also deduce that 0(1) = 3, 0(4) = 25 
and 0(5) = 17. With other sub-graphs we obtain the isomorphism 0 defined 
by 0 : Q £c —>■ Q 7 


0 - 3 ) 


(- 2 ) 


.( 6 ) 


( 1 ) 


-5 26, 

-3 12, 

-2 8 , 

-1 5, 


0 20 , 

1 3, 

2 15, 

3 1 —>- 1, 


4^ 25, 

5 17, 

6 23, 

7 1 —> 6, 


8 13, 

9 11, 

10 7, 

11 16, 


12 19, 

13 2, 

14 ^ 21 , 

15 10. 


With this bijection we deduce a part of the interleaver it : 

TT = [...,26, 7,12,8,5,20,3,15,1,25,17, 23,6,13,11,7,16,19,2,21,10,...] 

On the graphs we do not represent the positions involved in a single parity- 
check equation, we do not have edges with the corresponding label. But we 
know these values and we use them to determine the first and last positions 
of it. For example, the second parity-check equation represented in Q £c is 

/_o\ 

£ c ' = {—5, —4, —3, —1, 0} and the image by 0 of this parity-check equa¬ 
tion is £ 6 = {5,12,18,20,26}. With this parity-check equation, we extend 
the bijection with —4 i-» 18 (the only two unused values in these parity-check 
equations). With the same reasoning we deduce that 16 1 —>- 4. The first parity- 
check equation represented on Q £c is = {—7, —6, —5, —3, —2} and the 
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Figure 7: The graph 

25 13 



Figure 8: The graph Q\ 


corresponding equation on Q n is £ 5 = {8, 9,12,14, 26}, with these parity-check 
equations we deduce that —7 i-> 9 and —6 (->• 14 or —7 n- 14 and —6 n- 9, we 
have the same indeterminate for the two last positions. So we have 4 possible 
interleavers: 

7r = [14, 9,26,18,12, 8, 5, 20, 3,15,1, 25,17,23, 6,13,11, 7,16,19, 2, 21,10,4, 22, 24] 
or ti= [9,14, 26,18,12, 8, 5, 20, 3,15,1, 25,17,23, 6,13,11, 7,16,19, 2, 21,10,4, 22, 24] 

or 77= [14, 9, 26,18,12, 8, 5, 20, 3,15,1, 25,17,23, 6,13,11, 7,16,19, 2, 21,10,4, 24, 22] 

or 77= [9,14, 26,18,12, 8, 5, 20, 3,15,1, 25,17,23, 6,13,11, 7,16,19, 2, 21,10,4, 24, 22] 

Moreover we can also take the mirror of the isomorphism <f>, and we obtain 

4 new possible interleavers. 


4.6 Particular cases 

Indeterminate positions. If the reconstructed interleaver contains indetermi¬ 
nate positions, we search for these positions using noisy interleaved code¬ 
words. At each indeterminate positions we test all possible values. To test 
a position we reconstruct the missing parity-check equations and we verify 
the number of noisy interleaved codewords that satisfy these parity-check 
equations. If this number is less than a threshold (we can take M) 
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Figure 9: The graph f/f 


it is not the correct value for this position. 

Example 7. The graph associated to is on Figure [?| and the graph Qf 
on Figure 0 One bijection on edges is the same as in Example 0 but we 
can not know if 7 i —> 6 and 8 i—>• 13 or 7 n- 13 and 8 4 6. To determine 
the right bijection we reconstruct the missing parity-check equation and we 
test them with noisy interleaved codewords. We test 7 n- 6 and 8 H > 13, in 
this case the missing equation is {6,7,11,17,23}, then we test 7 i— > 13 and 
8 h-* 6, the missing equation is {7,11,13,17, 23}. With the number of noisy 
interleaved codewords that satisfy these equations we deduce the 8 possible 
interleavers as in Example 0 

Not the right length. If the reconstructed interleaver has not the right 
length, it is the case when C\ does not contain all parity-check equations 
of the same type, the missing parity-check equations are not classified. To 
recover the beginning and the end of the interleaver we continue the recon¬ 
struction by applying the same steps using unclassified parity-check equa¬ 
tions: we extend the graphs Qf b and Q^ c b then we label these graphs and 
deduce the entire interleaver. 


5 Experimental results 

We have run several experimental tests for different convolutional codes C 
and interleaver sizes N. 

In the first test we used the convolutional code defined by the generator ma¬ 
trix in polynomial form C 1 = (1 + D + D 2 + D 5 , 1 + D + D 3 + D 4 + D 6 ). 
This code satisfies one parity-check equation of weight 8. With a set of in¬ 
terleaved codewords we search for parity-check equations of weight 8 of C\ 
using a slightly improved method of [6] (we give in Table 0 the number M of 


22 




















N 

running time (in seconds) 

C 1 C 2 C 3 

1000 

5 

0.2 

5 

2 000 

6 

0.7 

10 

5 000 

7 

4 

60 

8 000 

11 

10 

130 

10 000 

12 

15 

185 


Table 1: Running time for C 1 = (1 + D + D 2 + D 5 ,1 + D + D 3 + D 4 + D 6 ), 
C 2 = (1 + D + D 2 ,1 + D 2 + D 3 ) and C 3 = (1 + D 2 + D 3 + D 5 + D 6 ,1 + D + 
D 2 + D 3 + D 6 ) 


codewords that we use and the running time for recovering all parity-check 
equations), then we applied our method to reconstruct the interleaver and 
the convolutional code. For these tests we assumed that s max = 25, and we 
give the running time in Table [0 

In the next test, the convolutional code was C 2 = (1 + D + D 2 ,1 + D 2 + D 3 ). 
This code has 5 types of parity-check equation of weight 6. To test our 
method with C 2 we assumed that s max = 10, the running times are also in 
Table |T] and [2l 

The last test was with the the convolutional code defined by C 3 = (1 + 

D 2 + D 3 + D 5 + D 6 ,1 + D + D 2 + D 3 + D 6 ). This code satisfies 11 types of 
parity-check equations of weight 10. To reconstruct the interleaver and the 
convolutional code we assumed s max = 20, see Table Q] and [2] 

In all cases, the interleaver and the convolutional code were reconstructed 
efficiently. To obtain these running times we used all parity-check equations 
of weight 8, 6 or 10. Recovering all parity-check equations of low weight 
may take time, but our method can be applied without having all parity- 
check equations. For example, with the first convolutional code C 1 = (l + 

D + D 2 + H 5 ,1 + D + D 3 + D A + D 6 ), we note in Table [3] the running 
time for reconstructing the interleaver and the convolutional code in case 
we have less than 100% of parity-check equations of weight 8. We can see 
that, for small lengths the time increases rapidly if we do not have all parity- 
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N 

M 

C 1 

run. 

time 

M 

C 2 

run. 

time 

M 

C 3 

run. 

time 

1000 

400 

60 

200 

3 

400 

300 

2 000 

500 

60 

400 

8 

600 

600 

5 000 

1400 

600 

900 

45 

1600 

2000 

8 000 

2000 

1800 

1300 

240 

2400 

3000 

10 000 

2600 

2700 

1700 

300 

2800 

9000 


Table 2: Running time in seconds for recovering all parity- 

check equations C 1 = (1 + D + D 2 + Z> 5 ,1 + D + D 3 + D 4 + D 6 ), 
C 2 = (1 + D + D 2 ,1 + D 2 + D 3 ) and C 3 = (1 + D 2 + D 3 + D 5 + D 6 , 1 + D + 
D 2 + D 3 + D 6 ) without noise 


check equations, but for large lengths having all parity-check equations is 
not necessary to reconstruct to reconstruct efficiently the interleaver and the 
convolutional code. 

We also test with noisy interleaver codewords, for the convolutional code 
C 2 and the binary symmetric channel of crossover probability p = 0.001 and 
p = 0.01, we note in table [4] the running time to recover almost all parity- 
check equations (more than 96% of them). These times are long but by 
parallelizing, the running time is divided by as much as executed programs. 
The running time to reconstruct the convolutional code and the interleaver 
is the same as in noiseless case. 


6 Conclusion 

This paper shows that when an interleaved convolutional code is used, then 
it can be efficiently reconstructed from the knowledge of a few hundred (or 
thousand) observed noisy codewords in the case of moderate noise of the 
channel by first recovering low-weight codewords in the dual of the interleaved 
convolutional code and then using this set of dual codewords to recover the 
convolutional structure and the interleaver. This assumption of moderate 
noise can be removed when the length N of the interleaver is sufficiently 
short (say below a few hundred) and is needed to ensure that most low-weight 
codewords are obtained by the slightly improved Cluzeau-Finiasz method [6] 
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N 

% of parity-check 
equations 

running time 
(in seconds) 

1000 

100 

5 


99 

7 


96 

37 


93 

110 

2 000 

100 

6 


95 

13 


93 

155 

5 000 

100 

7 


95 

78 

8 000 

100 

11 


97 

21 

10 000 

100 

12 


94 

63 


Table 3: Running time for C 1 = (1 + D + D 2 + D 5 ,1 + D + D 3 + D 4 + D 6 ) 


N 

P = 
M 

0.001 

runn. 

time 

P 

M 

= 0.01 

runn. 

time 

100 

100 

1 

100 

10 

200 

100 

3 

100 

240 

500 

300 

30 

200 

4 000 

1000 

400 

360 

200 

72 000 

2 000 

600 

16 000 




Table 4: Running time in seconds for recovering all parity-check equations 
when C = (1 + D + D 2 ,1 + D 2 + Z> 3 ) and with a crossover probability p 
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we used in our tests. Once these parity-check equations are recovered, a graph 
representing how these parity-check equations intersect is used to recover at 
the same time the interleaver and the convolutional code. This method is 
really fast, for instance the second phase took less than a few minutes in 
all our experiments and this even for very long interleavers (up to length 
N = 10000). This method applies to any convolutional code, it just needs 
convolutional codes that have reasonably low-weight and low-span codewords 
in the dual of the convolutional code, which is the case for virtually all 
convolutional codes used in practice. 
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